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Q^^ , Abstract 

C/5 . A niean field feedback artificial neural network algorithm is developed and explored for the set covering 

J-^ ' problem. A convenient encoding of the inequality constraints is achieved by means of a multilinear 

penalty function. An approximate energy minimum is obtained by iterating a set of mean field equations, 
in combination with annealing. The approach is numerically tested against a set of publicly available test 
problems with sizes ranging up to 5 x 10'^ rows and 10^ columns. When comparing the performance with 
C^ ' exact results for sizes where these are available, the approach yields results within a few percent from 

the optimal solutions. Comparisons with other approximate methods also come out well, in particular 
given the very low CPU consumption required - typically a few seconds. Arbitrary problems can be 
processed using the algorithm via a public domain server. 
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1 Introduction 



The set covering problem (SCP) is a well known NP-hard combinatorial optimization problem, which 
represents many real-world resource allocation problems. Exact solutions can be obtained by e.g. a 
branch-and-bound approach for modestly sized problems. For larger problems various approximative 
schemes have been suggested (see e.g. [|], ||)- In this paper we develop a novel approach based on 
feedback Artificial Neural Networks (ANN), derived from the mean field approximation to the thermo- 
dynamics of spin systems. 

ANN is a computer paradigm that has gained a lot of attention during the last 5-10 years. Most of 
the activities have been directed towards feed-forward architectures for pattern recognition or function 
approximation. ANN, in particular feedback networks, can also be used for difficult combinatorial opti- 
mization problems (e.g. [£|-|l^). Here ANN introduces a new method that, in contrast to most existing 
search and heuristics techniques, is not based on exploratory search to find the optimal configuration. 
Rather, the neural units find their way in a fuzzy manner through an interpolating, continuous space 
towards good solutions. There is a close connection between feedback ANN and spin systems in statis- 
tical physics. Consequently, many mathematical tools used for dealing with spin systems can be applied 
to feedback ANN. Two steps are involved when using ANN for combinatorial optimization: 

1. Map the problem onto an energy function, e.g. 

E{S)^ -'^w.js^sj (1) 

where S = {si; i = \ . . . N} is a set of binary spin variables Si G {0, 1}, representing the elementary 
choices involved in minimizing E, while the weights Wij encode the costs and constraints. 

2. To find configurations with low E, iterate the mean field (MF) equations 

where T is a fictitious temperature while V — {wi}, where w^ G [0, 1] represents the thermal average 
(sj)t, and allows for a probabilistic interpretation. 



Eqs. (y,^ only represent one example. More elaborate encodings have been considered, e.g. based on 
Potts spins allowing for more general basic decisions elements than simple binary ones Pq . A propagator 
formalism based on Potts neurons has been developed for handling topological complications in e.g. 
routing problems ||] g . 

The ANN approach for SCP that we develop here differs from the one that was successfully used for the 
somewhat related knapsack problem in [nSl E2|, in particular with respect to encoding the constraints. 
Whereas a non-linear step- function was used in M, n3|, we will here use a multilinear penalty, which 
in addition to being theoretically more appealing, also appears to be very efficient. Furthermore, an 
automatic procedure for setting the relevant T-scale is devised. 

The algorithm is extensively tested against a set of publicly available benchmark problems p4| with sizes 
(rows X columns) ranging from 200x1000 to 5000x10®. The approach yields results, typically within a 



few percent from the exact optimal solutions, for sizes where these are available. Comparisons with 
other approximate methods also come out well. The algorithm is extremely rapid - the typical CPU 
demand is only a few seconds (on a 400MHz Pentium II). 

A public domain WWW server has been set up, where arbitrary problems can be solved interactively. 

This paper is organized as follows: In Sect. 2 we define the set covering problem, and in Sect. 3 
we describe its encoding in terms of a neural network energy function and discuss the mean field 
treatment. Sect. 4 contains numerical explorations and comparisons. A brief summary is given in 
Sect. 5. Appendices A and B contain a derivation of the mean field equations, and some algorithmic 
implementation issues, respectively. Tables from the numerical explorations are found in Appendix 
C, while Appendix D contains pointers and instructions for the WWW server. It should be stressed 
that this paper is self-contained - no prior knowledge of feedback neural networks or the mean field 
approximation is necessary. 



2 The Set Covering Problem 



The set covering problem (SCP) is the problem of finding a subset of the columns of an MyiN zero-one 
matrix A = {ati G {0, 1}; fc = 1, . . . , M; i = 1, . . . , N} that covers all rows at a minimum cost, based on 
a set of column costs {ci;i — 1, . . . , N}. SCP is conveniently described using a set of binary variables, 
iS = {si e {0, 1}; i = 1, . . . , N}. More precisely, SCP is defined as follows: 

N 

Minimize 2, c^Si (3) 

i=l 

N 

subject to 2_] o,kiSi > 1, fc = 1, . . . , M (4) 

with Sje{0,l} i = l,...,7V (5) 

Eq. (H) states that at least one column must cover a particular row. The special case where all costs c^ 
are equal is called the unicost SCP. There is subclass of SCPs that has a nice graphical interpretation: 
If the zero-one matrix A has the property that each row contains exactly two I's then we can interpret 
A as the vertex-edge matrix of a graph with TV vertices and M edges. The unicost SCP is then a vertex 
covering problem, where the task is to find the minimal number of vertices that covers all edges of the 
graph. SCPs (including weighted vertex covering) are NP-hard combinatorial optimization problems. 
If the inequalities of Eq. (|j) are replaced by equalities, one has the set partition problem (SPP). Both 
SCP and SPP have numerous resource allocation applications. 

The following quantities will be used later: 

Density: p = j^ ^ aki (6) 

ki 
M 

Column Sums: Ni — \^ o.ki (7) 

k 



N 



Row Sums: Mk = 2, o,ki (8) 



3 The Mean Field Approach 

3.1 The Energy Function for SCP 

We start by mapping the SC problem of Eqs. (||, |[ H) onto a spin energy function E{S) (step 1 in the 
introduction) , 

N M N 

^(5)=^c,s, + a^J|(l-afc,s,) (9) 

i=l k=li=l 

The first term yields the total cost and the second one represents the covering constraint of Eq. (Eh by 
imposing a penalty if a row is not covered by any column. 

The constraint term is a multilinear polynomial in the spin variables s^, i.e. it is a linear combination 
of products S1S2 ■ ■ ■ sk of distinct spins. This is attractive from a theoretical point of view, and the 
differentiability of E enables a more quantitative analysis of the dynamics of the mean field algorithm. 

An alternative would be to implement the inequality constraints using a piecewise linear function |13|| , 

M / N \ 



Y,^[l-Y,aM-%\ (10) 



a 

k=l 



with 0(a;) — xQ(x) = x if x > and otherwise. This yields a non-differentiable energy function and 
generally an inferior performance as compared to the polynomial representation (0). 



3.2 The Statistical Mechanics Framework 

The next step is to minimize E{S). Using some local updating rule will most often yield a local minimum 
close to the starting point, with poor solutions as a result. Simulated annealing (SA) |8] is one way of 
escaping from local minima since it allows for uphill moves in E. In SA a sequence of configurations S 
is generated according to a stochastic algorithm, such as to emulate the probability distribution 

P{S) = — r^.— (11) 

^ ' 'yg-E{S')/T ^ ' 

S' 

where the sum runs over all possible configurations S' . The parameter T (temperature) acts as a noise 
parameter. For large T the system will fluctuate heavily since P{S) is very flat. For the SCP this 
implies that the sequence contains mostly poor and infeasible solutions. On the other hand, for a small 
T, P{S) will be narrow, and the sequence will be strongly dependent upon the initial configuration and 
contain configurations only from a small neighborhood around the initial point. In SA one generates 
configurations while lowering T (annealing), thereby diminishing the risk of ending up in a suboptimal 



local minimuni. This is quite CPU-consuming, since one has to generate many configurations for each 
temperature following a careful annealing schedule (typically Tk = ro/log(l + k) for some To) in order 
to be certain to find the global minimum. 

In the mean field (MF) approach the costly stochastic SA is approximated by a deterministic process. 
MF also contains an annealing procedure. The original binary variables Si are replaced by continuous 
mean field variables Vi G [0,1], with a dynamics given by iteratively solving of the MF equations for 
each T. 

An additional advantage of the MF approach is that the continuous mean field variables can evolve in 
a space not accessible to the original variables. The intermediate configurations at non-zero T have a 
natural probabilistic interpretation. 



3.3 Mean Field Theory Equations 

Our objective is now to minimize E using the MF method. The binary spin variables s,; are replaced 
by mean field variables Wi, representing solutions to the MF equations (a derivation of these is given in 
Appendix A). 

v, = l/(l + cxpf^AE,(v^^^))), i = l,...,N (12) 

where 

AeJv^'A =e(v^'\v^ = i]-e(v^'\v, = 0] = dE/dv, (13) 

The set V^*' denotes the complementary set {vj,j ^ i}. From Eq. (pi) we get 



M iV 






The MF equations are solved iteratively while annealing in T. It should be noted that the equation for Vi 
does not contain any feedback, i.e no explicit dependence on Vi itself. This yields a smooth convergence, 
and it is usually only necessary with a few iterations at each value of T. What remains to be specified 
are the parameters a and T. The latter will be discussed next and we return to the choice of a in 
connection with the numerical explorations in Sect. 4. 



3.4 Critical Temperatures 

In the limit of high temperatures, T —^ oo, the MF variables {vi} will, under the dynamics defined 
by iteration of Eq. (p^), converge to a trivial symmetric fixed point with Vi = 1/(1 + cxp(O)) = i, 
corresponding to no decision taken. At a finite but high T, the corresponding fixed point will typically 
deviate slightly from the symmetric point. 

For many problems, a bifurcation occurs (indicative of a transition from a disordered phase to an 
ordered one) at a critical temperature T^, where the trivial fixed point loses stability and other fixed 
points emerge, which as T — > converge towards definitive candidate solutions to the problem, in terms 



oivi € {0, 1}. For some problems, a cascade of bifurcations occur, each at a distinct critical temperature, 
but in the typical case there is a single bifurcation. 

It is then of interest to estimate of the position of Tc, which defines a suitable starting point for the MF 
algorithm. Such an estimate can be obtained by means of a linear stability analysis for the dynamics 
close to the fixed point. For the special case of a symmetric unicost SCP with constant row and column 
sums for the matrix A, there will be a sharp transition around 

Tc « ap^M2-P^ (15) 

where p is defined in (|6|). 

For non-unicost problems, Tc is harder to estimate, and there might even be no bifurcation at all. For 
such problems, a suitable initial T is instead determined by means of a fast preliminary run of the 
algorithm (see below). 



4 Numerical Explorations 

4.1 Implementation Details 

The annealing schedule for T and the value of the constraint parameter a have to be determined before 
we can run the algorithm. The former is accomplished by a geometric decrease of T, 

Tt+i = kTt (16) 

where k is set to 0.80 and Tq is determined by a fast prerun of the algorithm (see below). The number 
of iterations of Eqs. (O) for each value of T is not fixed; T is lowered only when all vi have converged. 

In order to ensure a valid solution at low T, the size of the constraint term in Eq. (nlh must be larger than 
the largest cost Cmax that is part of the solution. Using a too large a will however reduce the solution 
quality since E is then dominated by the covering constraint. Our choice of a therefore depends on 
Cmax- Ideally, M/{pM) = 1/p columns would suffice to cover each row of A. If we further optimistically 
assume that the 1/p smallest costs can be chosen for the solution, Cmax can easily be found. However, 
for most problems we need more than 1/p columns, which makes it difficult to estimate Cmax except for 
unicost problems where all column costs are equal. 

To determine (an approximate) Cmax for a non-unicost SCP, we perform a fast prerun with a smaller 
annealing factor k = 0.65, and with a = 1.01. From this prerun one can also obtain an estimate the 
critical temperature Tc as the T where the saturation (defined below) deviates from 0. The second run 
is then initated at Tq — 2Tc, thereby avoiding unnecessary updates of Vi at high T. 

This procedure for setting a also requires rescaling of the costs; for all problems we set Ci ^r Cij maxj (cj ) . 
See Table for a summary of the parameters used. 

The evolution of the MF variables \yi\ is conveniently monitored by the saturation S, 

^ = ]^E(^»-l/2)' (17) 



Unicost SCP Non-unicost SCP 
Prerun Second run 



k 


0.80 


a 


0.5 


To 


50 



0.65 


0.8 


1.01 


i-05*c;;„^ 


50 


2*Tc 



Table 1: Summary of the parameters k, a and To used in the algorithm. 



A completely "undecided" configuration, S = 0, means that every Vi has the value 1/2. During the 
annealing process, as the Vi approach either 1 or 0, S converges to 1. The transition between S = 
and S = 1 is usually smooth for a generic SCP. However, when a bifurcation is encountered (see above), 
E can change abruptly; this occurs e.g. for unicost SCP where there is no natural ordering among the 
costs {Ci}. 

Fig. n shows the evolution of the mean field variables {vi} and the saturation E for the problems 4. 1 
and cyc09; the latter is a unicost problem (see Appendix C), and as can be seen from Fig. Rp, it clearly 
exhibits a bifurcation. 




100 200 

Iterations 



300 



Figure 1: Evolution of the mean field variables Vi as T is lowered for 4. 1 (a) and cyc09 (b) respectively. 
Also shown is E (Eq. ([l^)). Note that the number of iterations is intentionally large for visualization 
purposes. 



A summary of the algorithm can be found in Appendix B, while Appendix D gives the address of and 
instructions for a WWW server, where the MF algorithm is applied to user-defined SCPs. 



4.2 Numerical Results 

The performance of the algorithm is evaluated using 16 problem sets found in the OR-Library benchmark 
database |1J|. These 16 sets consist of 91 problems, out of which 19 are unicost SCP. The algorithm is 
coded in C and the computations are done on a 400MHz Pentium II PC. The details of the OR-library 



problems are given in Table CI and our results can be found in Tables |C2|, |C3| an d C4 in Appendix C. 
The optimal or currently best known values are taken from refs. W 0, [4|, |7|, |3|, |ll[| . 

For each of the test SCPs, 10 trials of our algorithm are performed. In Tables C3-C4, the best and 
the average costs are listed for each problem. For 10 of the problems our method found the optimal 
solution. The MF results typically are within a few percent of the optimal solutions, as can be seen in 
Table 0. Large relative deviations from optimum are seen in the unicost CLR problems, which appear to 
be difhcult for our approach. However, the optimal (integer) costs for these problems are low (23 - 26); 
this gives a large effect on the relative deviations even for a small change in the found costs. Decreasing 
the obtained costs by unity will change the mean relative deviation from 16% to 10%. It is also important 
to notice that we use a common set of algorithm parameters {a, fc, Tq) for all unicost problems, without 
any parameter optimization for each problem. Another set of parameters might be more advantageous 
for the CLR problems. 



Problem set 

Rel. deviation (%) 


4 
2.1 


5 
2.7 


6 
3.5 


A 
2.2 


B 

1.1 


C 
1.7 


D 

2.9 


E 



Problem set 

Rel. deviation (%) 


NRE 
3.5 


NRF 
4.4 


NRG 
3.1 


NRH 

2.0 


Rail 
6.8 


CYC 
3.7 


CLR 

16 


STS 
2.5 



Table 2: Mean relative deviation from the optimal (or best known) solution. 



In our implementation of the MF algorithm, the time r for a complete update of all variables {vi} scales 
approximately linearly with the number of non-zero entries in A, 

T cx NMp (18) 

This appealing property is feasible due to efficient calculations of AEi in Eq. (nj) that utilizes the 
sparse nature of A. For the total solution time, r should be multiplied by the number of iterations 
needed - empirically around 100, independently of problem size. Fig. p^ shows the mean solution time 
versus NMp. 

Compared to other heuristic approaches, ours does not find the optimal cost as often as e.g the genetic 
algorithm [^ . It is however very competitive with respect to speed. In ref. y nine different approxima- 
tion algorithms were tested on a large number of unicost problems (including the set considered here); 
our approach is comparable to the top ones in both performance and speed. 



5 Summary 

We have developed a mean field feedback neural network approach for solving set covering problems. 
The method is applied to a standard set of benchmark problems available in the OR-library database. 
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Figure 2: Solution time in seconds, referring to a Pentium II 400 MHz computer, as function of the 
number of non-zero entries in the problem matrix A. Left and right figure shows non-unicost and unicost 
SCP, respectively The rail problems are not shown since a slightly different implementation is used for 
these problems (see Appendix B for details). 



The method is also implemented in a public WWW server. 

The inequality constraints involved are conveniently handled by means of a multi- linear penalty function 
that fits nicely into the mean field framework. The bifurcation structure of the mean field dynamics 
involved in the neural network approach is analyzed by means of a linearized dynamics. A simple and 
self-contained derivation of the mean field equations is provided. 

High quality solutions are consistently found throughout a range of problem sizes ranging up to 5 x 10'^ 
rows and 10^ columns for the OR-library problems without having to fine-tune the parameters, with 
a time consumption scaling as the number of non-zero matrix elements. The approach is extremely 
efficient, typically requiring a few seconds on a Pentium II 400 MHz computer. 

The mean field approach to SCP can easily be modified to apply to the related, and more constrained, 
set partitioning problem. One simply has to replace the inequality constraint term with one that handles 
the equality constraint present in the set partitioning problem. 
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Appendix A. Mean Field Approximation 

Here follows for completeness a derivation of the mean field equations (sec e.g. [^). Let E (S) be an 
energy function of a set of binary decision variables (spins) <S = {si\si € {0, 1}, i = 1, ■ ■ • , N}. If we 
assume a Boltzmann probability distribution for the spins, the average {si)T will be given by 



^s,exp(-i?(5)/r) 

_s 

^exp(-i?(5)/T) 



{s.)t = 4^ ■ — — (Al) 



s 
where the sums run over all possible configurations S. We can manipulate this expression to obtain, 



Y, s.exp(-i?(5«,s,)/r 



s E exp(-ii;(5«,s,0/T 



{S^)T = ^ ''""■' (A2) 

^exp(-£;(5)/T) 

5 

where S^^' denotes the set {sj, j ^ i} of all spins but Si. If we now perform the sums over Si in the 
numerator, we get 



5^exp(-£;(5)/T)— \-- 

^ 1 + exp (AKj 

^cxp(-^(5)/r) " \l + cxp(A£:,/T)/y 

where I^E,{S'^'^) = E{S^'\s, = 1) - E{S'^'\s^ = 0) (A4) 



— /T) 
^^)t = ^^ ■ —^ ^ (^— ^TTTTFt) (A3) 



So far there are no approximations, the expression for {si)T has just been rewritten. Eq. ( |A3[ ) states 
that the expectation value of Si is equal to the expectation value of a nonlinear function / of all the 
other spins. The mean field approximation consists of approximating the expectation value (/ (iS'*')) by 
/ ((5*^*')). With Vi denoting (si), and V*-*' the complementary set {vj,j ^ i}, this amounts to making 
the replacement 

\ 1 + exp {AE, {S(i)) /T) / ^ ^ 1 + exp (AE, (V(*)) /T) ^^^^ 

in Eq. ( |A3| ). This results in a set of self-consistency equations for V, the mean field equations 

V, = -. , ,.,, , , , i = 1 ... TV (A6) 

l + cxp(AS, (vM)/r)' ^ ' 

which in general must be solved numerically, e.g. by iteration. 

The mean field approximation often becomes exact in the limit of infinite range interactions where each 
spin variable interacts with all the others. This can be seen from A£'j(iS(')) which then becomes a sum 
of many (approximately) independent random numbers, and a central limit theory can be applied. 
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Appendix B. Algorithm Details 



Here follows a summary of the MF annealing algorithm for finding approximate solutions to set covering 
problems. The procedure presented below is used for all problems in this study except for the large rail 
problems where numerical problems caused by limited machine precision comes into play. The problem 
arises when calculating the product in Eq. (|l4|), which for large problems can contain many factors. A 
work-around is implemented by a simple truncation, 



if V, < 0.05 

Vi otherwise 



This numerical fix is only used when calculating the AEi (V'*-*) in Eq. (|l4|). 
Algorithmic outline of our approach: 



(Bl) 



1. Rescale all weights such that Cj S [0, 1] j — I, . . . ,N 

2. Initiate all Vj's close to 0.5 

(vj £ [0.499, 0.501] uniformly random) 

3. Set a — 1.05 Cmax, or a — 0.5 for unicost problems 

4. Set the temperature T — 2Tc, or T = 50 for unicost prob- 
lems 

5. Randomly (without replacement) select one variable v^ 

6. Update Vk according to Eq. ( Jl^ ) 

7. Repeat points 5.-6. TV times (so that all Vj have been up- 
dated once) 

8. Repeat points 5.-7. until no changes occur 



^N 



(old). 



<0.01) 



(e.g. defined by l/NJ2j \^j 
9. Decrease the temperature, T —^ O.SOT 

10. Repeat 5-9 until S (Eq. (|l|)) is close to 1 
(e.g. E > (iV - 0.5)/iV) 

11. Finally, the mean field solution is given by setting 
Sj = 1 if Vj > 0.5 and Sj — otherwise, j = I, . . . , N 



Figure 3: Implementation details of the mean field algorithm for set covering problems. 
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Appendix C. Benchmark Results 



Problem 


Rows 


columns 


Density 


Number of ones per 


Number of 


set 


(M) 


(N) 


(%) 


row [Max-Min-Ave] 


Problems 


4 


200 


1000 


2 


36-8-20 


10 


5 


T> 


2000 


2 


60-21-40 


55 


6 


?? 


1000 


5 


71-29-50 


5 


A 


300 


3000 


2 


81-38-60 




5 


B 


?? 


55 


5 


191-114-150 




5 


C 


400 


4000 


2 


105-56-80 




5 


D 


j; 


II 


5 


244-159-200 




5 


NRE 


500 


5000 


10 


561-444-499 




5 


NRF 


II 


V 


20 


1086-914-999 




5 


NRG 


1000 


10000 


2 


258-153-199 




5 


NRH 


;; 


57 


5 


580-436-499 




5 


Rail507 


507 


63009 


1.3 


7753-1-807 




L 


Rail516 


516 


47311 


1.3 


7805-1-610 




! 


Rail582 


582 


55515 


1.2 


8919-1-690 




5 


Rail2536 


2536 


1081841 


0.40 


86666-1-4335 






Rail2586 


2586 


920683 


0.34 


72553-1-3097 






Rail4284 


4284 


1092610 


0.24 


56181-1-2633 






Rail4872 


4872 


968672 


0.20 


69708-1-1897 




; 


E 


50 


500 


20 


124-77-100 


5 


CYC6 


240 


192 


2.1 


4-4-4 


1 


CYC7 


672 


448 


0.9 


JJ 






CYC8 


1792 


1024 


0.4 


55 






CYC9 


4608 


2304 


0.2 


55 






CYCIO 


11520 


5120 


0.08 


;5 






CYC 11 


28160 


11264 


0.04 


55 






CRLIO 


511 


210 


12 


126-10-26 






CRLll 


1023 


330 


;; 


210-20-41 






CRL12 


2047 


495 


» 


330-30-62 






CRL13 


4095 


715 


55 


495-50-89 






STS45 


330 


45 


6.7 


3-3-3 






STS81 


1080 


81 


3.7 


5! 






STS135 


3015 


135 


2.2 


55 






STS243 


9801 


243 


1.2 


55 







Table CI: Test problem details. The density refers to the percentage of ones in the A matrix. The 
maximum, minimum and avearage of ones per row is taken over all problems in each set. E - STS are 
unicost problems. 
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Problem Optimal value 



Best MF 

solution in 

10 trials 



Average MF 

solution over 

10 trials 



Solution time 



4.1 

4.2 

4.3 

4.4 

4.5 

4.6 

4.7 

4.8 

4.9 

4.10 

5.1 

5.2 

5.3 

5.4 

5.5 

5.6 

5.7 

5.8 

5.9 

5.10 

6.1 

6.2 

6.3 

6.4 

6.5 

A.l 

A.2 

A.3 

A.4 

A.5 

B.l 

B.2 

B.3 

B.4 

B.5 

C.l 

C.2 

C.3 

C.4 

C.5 

D.l 

D.2 

D.3 

D.4 

D.5 



429 


435 


435.6 


0.44 


512 


517 


518.0 


0.49 


516 


531 


532.7 


0.45 


494 


512 


520.9 


0.48 


512 


522 


524.1 


0.46 


560 


566 


567.8 


0.44 


430 


446 


446.0 


0.45 


492 


492 


493.8 


0.46 


641 


658 


661.4 


0.47 


514 


521 


521.0 


0.45 


253 


260 


268.6 


0.87 


302 


316 


316.0 


0.83 


226 


229 


229.0 


0.86 


242 


247 


247.5 


0.86 


211 


214 


214.3 


0.83 


213 


213 


213.2 


0.82 


293 


304 


305.0 


0.86 


288 


299 


300.1 


0.90 


279 


281 


281.0 


0.82 


265 


273 


274.0 


0.83 


138 


143 


143.0 


0.63 


146 


153 


153.2 


0.62 


145 


150 


150.2 


0.60 


131 


132 


133.1 


0.62 


161 


169 


169.8 


0.62 


253 


260 


261.5 


1.5 


252 


257 


258.3 


1.5 


232 


238 


241.3 


1.5 


234 


238 


239.7 


1.5 


236 


238 


238.9 


1.4 


69 


70 


71.2 


2.2 


76 


77 


77.6 


2.3 


80 


83 


83.7 


2.2 


79 


80 


80.0 


2.3 


72 


72 


72.0 


2.3 


227 


233 


233.6 


2.2 


219 


222 


224.3 


2.2 


243 


249 


251.1 


2.3 


219 


220 


220.1 


2.3 


215 


219 


219.1 


2.2 


60 


64 


64.6 


3.7 


66 


66 


66.3 


3.8 


72 


73 


75.1 


3.9 


62 


63 


63.0 


3.8 


61 


64 


64.6 


3.8 



Table C2: Results for problems 4 ~ D (see Table |C|). Solution time refers 400 MHz Pentium II CPU 
seconds and includes the prerun for non-unicost problems. 
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Problem 



Current 
best value 



Best MF 

solution in 

10 trials 



Average MF 

solution over 

10 trials 



Solution time 



NRE.l 


29 


29 


29.5 


9.8 


NRE.2 


30 


32 


32.1 


9.8 


NRE.3 


27 


28 


28.2 


9.7 


NRE.4 


28 


29 


29.7 


9.8 


NRE.5 


28 


29 


29.0 


9.8 


NRF.l 


14 


14 


14.9 


19 


NRF.2 


15 


15 


15.4 


18 


NRF.3 


14 


15 


15.2 


19 


NRF.4 


14 


15 


15.4 


19 


NRF.5 


13 


14 


14.7 


19 


NRG.l 


176 


180 


180.1 


10 


NRG. 2 


155 


157 


159.0 


10 


NRG.3 


166 


173 


174.9 


10 


NRG.4 


168 


175 


176.3 


10 


NRG.5 


168 


175 


176.9 


11 


NRH.l 


64 


65 


66.4 


21 


NRH.2 


64 


66 


67.0 


21 


NRH.3 


59 


62 


62.8 


20 


NRH.4 


58 


60 


61.8 


21 


NRH.5 


55 


56 


56.4 


21 


Rail507 


174 


187 


188.2 


37 


Rail516 


211 


186 


187.9 


26 


Rail582 


182 


222 


225.5 


32 


Rail2536 


691 


737 


740.0 


1100 


Rail2586 


951 


1018 


1026.7 


830 


Rail4284 


1065 


1152 


1162.1 


1100 


Rail4872 


1534 


1640 


1643.5 


1050 



Table C3: Results for problems NRE - Rail (see Table |Cl|). Solution time refers 400 MHz Pentium II 
CPU seconds and includes the prerun for non-unicost problems. The notation current best value may 
for some instances mean optimal value. 
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Problem 


Current 


Best MF 


Average MF 


Solution time 




best value 


solution in 
10 trials 


solution over 
10 trials 




E.l 


5 


5 


5.3 


0.15 


E.2 


5 


5 


5.0 


0.14 


E.3 


5 


5 


5.0 


0.15 


E.4 


5 


5 


5.0 


0.14 


E.5 


5 


5 


5.0 


0.16 


CYC.6 


60 


62 


63.0 


0.08 


CYC.7 


144 


151 


153.4 


0.20 


CYC.8 


344 


348 


352.1 


0.62 


CYC.9 


780 


829 


832.6 


1.6 


CYC.IO 


1792 


1870 


1882.3 


3.9 


CYC.ll 


4103 


4240 


4248.7 


9.6 


CLR.IO 


25 


27 


29.0 


0.36 


CLR.ll 


23 


26 


28.9 


1.0 


CLR.12 


26 


30 


30.9 


3.2 


CLR.13 


26 


31 


32.9 


10 


STS.45 


30 


31 


31.8 


0.03 


STS.81 


61 


63 


63.9 


0.11 


STS.135 


104 


105 


107.4 


0.32 


STS.243 


202 


205 


206.8 


1.1 



Table C4: Results for the unicost problems E - STS (see Table |C1|). Solution time refers 400 MHz 
Pentium II CPU seconds. The notation current best value may for some instances mean optimal value. 
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Appendix D. WWW Server 

A program executing the mean field algorithm for set covering problems as presented in this paper can 
be publicly used by means of a World Wide Web server. A user can interactively submit a file defining 
an instance of SCP, and obtain the found solution. The URL of the WWW server is: 



tittp : //www . thep . lu . se/complex/mf .server . html 



An instance of SCP is defined by specifying the costs c,; and the matrix A. Two formats, row and 
column ordering, are supported for the file that lists Ci and the none-zero entries of A; they are defined 
as follows: 

Row Ordering 



M N 

C1C2 . . .cn 

Ml "space separated list of all non-zero entries for row 1" 

M2 "space separated list of all non-zero entries for row 2" 

Mm "space separated list of all non-zero entries for row M" 



Column Ordering 



M N 

Ci Ni "space separated list of all non-zero entries for column 1" 

C2 N2 "space separated list of all non-zero entries for column 2" 

Cn Nn "space separated list of all non-zero entries for column N" 



The column sums Ni and row sums Mk are defined in Eqs. (0^). As an example, consider the SCP 
instance defined by 

/ 1 1 1 \ 

c= (1,2,3,4,5), A= J } J J (Dl) 

V 1 1 1 / 

for which the column and row ordering formats read: 

Row Ordering Column Ordering 

4 5 4 5 

12 3 4 5 12 13 

3 13 5 2 2 2 3 

2 2 4 3 2 14 

3 12 5 4 2 2 4 
3345 53134 
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The solver returns the found cost (energy) E (Eq.(H)), together with a characterization of the problem. 
Upon request, a file that lists the columns used in the solution is also provided. 

There is a limitation [B) on the size of the problems that can be submitted to the server. Instances of 
SCP with MNp > B will not be considered. Presently B, which is limited by the available memory of 
the server, is given by 3a;10^. 
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